rank | frequency | n-gram |
---|---|---|
1 | 13568 | -e |
2 | 6207 | -d |
3 | 3762 | -h |
4 | 3317 | -r |
5 | 3216 | -t |
rank | frequency | n-gram |
---|---|---|
1 | 3504 | -ch |
2 | 2647 | -er |
3 | 2560 | -de |
4 | 2438 | -he |
5 | 1423 | -ng |
rank | frequency | n-gram |
---|---|---|
1 | 2312 | -che |
2 | 1911 | -sch |
3 | 1035 | -ung |
4 | 1004 | -chd |
5 | 664 | -nge |
rank | frequency | n-gram |
---|---|---|
1 | 1676 | -sche |
2 | 1326 | -isch |
3 | 730 | -schd |
4 | 529 | -chde |
5 | 372 | -unge |
rank | frequency | n-gram |
---|---|---|
1 | 1260 | -ische |
2 | 467 | -schde |
3 | 392 | -lisch |
4 | 255 | -ischd |
5 | 206 | -disch |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings